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We present a bouquet of continuity bounds for quantum entropies, falling broadly into two classes: 
First, a tight analysis of the Alicki-Fannes continuity bounds for the conditional von Neumann 
entropy, reaching almost the best possible form that depends only on the system dimension and the 
trace distance of the states. Almost the same proof can be used to derive similar continuity bounds 
for the relative entropy distance from a convex set of states or positive operators. As applications 
we give new proofs, with tighter bounds, of the asymptotic continuity of the relative entropy of 
entanglement, Er, and its regularization Er , as well as of the entanglement of formation, Ef- 
Using a novel “quantum coupling” of density operators, which may be of independent interest, we 
extend the latter to an asymptotic continuity bound for the regularized entanglement of formation, 
aka entanglement cost, Ec = Ef?. 

Second, we derive analogous continuity bounds for the von Neumann entropy and conditional 
entropy in infinite dimensional systems under an energy constraint, most importantly systems of 
multiple quantum harmonic oscillators. While without an energy bound the entropy is discontinuous, 
it is well-known to be continuous on states of bounded energy. However, a quantitative statement 
to that effect seems not to have been known. Here, under some regularity assumptions on the 
Hamiltonian, we find that, quite intuitively, the Gibbs entropy at the given energy roughly takes 
the role of the Hilbert space dimension in the finite-dimensional Fannes inequality. 


I. INTRODUCTION 

On finite dimensional systems, the von Neumann en¬ 
tropy S(p) = — Trplogp is continuous, but this becomes 
useful only once one has explicit continuity bounds, most 
significantly the one due to Fannes fid ] , the sharpest form 
of which is the following: 

Lemma 1 (Audenaert Q, Petz HI) For states p 
and a on a Hilbert space A of dimension d = \A\ < oo, 
if ||| P ~ cr||i < e < 1, then 

[logd */e> 1-i, 

with h(x) = H(x, 1 — x) = — xlogx— (1 — x) log(l — x) the 
binary entropy. A simplified, but universal bound reads 

IS'(p) - S'(cr)! < elog d + h(e). 

We include a short proof for self-containedness, and also 
because it deserves to be known better. It seems that 
it was first found by Petz [32|, Thm. 3.8], who credits 
Csiszar for the classical case; the latter seems to have 
appeared first in Zhang’s paper [55] (see also 0). 

Proof. We only have to treat the case e < 1 — |. We 
begin with the classical case of two probability distribu¬ 
tions p and q on the same ground set of d elements. It 
is well known, and in fact elementary to confirm, that 
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one can find two jointly distributed random variables, 
X ~ p and Y ~ q (meaning X is distributed according 
to the probability law p , and Y according to q), with 
Pr{X ^ Y} = |||p — glli < e. The crucial idea is to 
let Pr{X = Y = x) = min (p x ,q x ) and to distribute the 
remaining probability weight suitably off the diagonal. 
(This is also the minimum probability over all such cou¬ 
pled random variables (55|. For the reader with a taste 
for the sophisticated, this is the Kantorovich-Rubinshtein 
dual formula for the Wasserstein distance in the case of 
the trivial metric d(x,y) = 1 for all x ^ y and d(x, x) = 0, 
cf. the broad survey [5|.) Then, by the monotonicity of 
the Shannon entropy under taking marginals and Fano’s 
inequality (see [lij), 

H(X) - H(Y) < H(XY) - H(Y) 

= H(X\Y) < e log(d - 1) + h(e), 

and likewise for H(Y) — H (X). [For the simplified bound, 
we use H(X\Y) < elog d + h(e).} 

Next, we reduce the quantum case to the classical one: 
W.l.o.g. S(p) < S(a), and consider the dephasing oper¬ 
ation E in the eigenbasis of p , which maps p to itself, a 
diagonal matrix with a probability distribution p along 
the diagonal, and a to E(a), a diagonal matrix with a 
probability distribution q along the diagonal. Hence 

H(p) = S(p) < S(a) < S(E(a)) = H(q). 

At the same time, ||p— g||i = \\E(p) — E(a)\\i < ||p —ct||i, 
and so, using the classical case, 

I S{p) - 5(<t)| < H(q) - H(jp) < e log(d — 1) + h(e). 

Note that the inequality is tight for all e and d , e.g. by 
a = 10)(01 and p = (1 - e)|0)(0| + ^(1 - |0)(0|). □ 
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We are interested in bounds of the above form, i.e. only 
referring to the trace distance of the states and some gen¬ 
eral global parameter specifying the system, for a number 
of entropic quantities, starting with the conditional von 
Neumann entropy, relative entropy distances from certain 
sets, etc, which have numerous applications in quantum 
information theory and quantum statistical physics. Fur¬ 
thermore, and perhaps even more urgently, in situations 
of infinite dimensional Hilbert spaces, where the above 
form of the Fannes inequality becomes trivial. 

The rest of the paper is structured as follows: in Sec¬ 
tion El we present and prove an almost tight version of 
Lemma Q] for the conditional entropy (originally due to 
Alicki and Fannes 0 ) , then in Section m we general¬ 
ize the principle behind our proof to a family of relative 
entropy distance measures from a convex set; in these 
two sections we also present some illustrative applica¬ 
tions of the conditional entropy bounds to two entangle¬ 
ment measures, Er and Er, as well as their regulariza¬ 
tions. In Section m we expand the methodology of the 
first part of the paper to infinite dimensional systems, 
where Fannes-type continuity bounds are obtained under 
an energy constraint for a broad class of Hamiltonians, 
and specifically for quantum harmonic oscillators. All 
entropy continuity bounds are stated as Lemmas, while 
the applications appear as Corollaries, and two auxiliary 
results (on “quantum coupling” of density matrices) as 
Propositions. The absence of Theorems is meant to en¬ 
courage readers to apply the results presented here. 


II. CONDITIONAL ENTROPY 

Alicki and Fannes [l| proved an extension of the Fannes 
inequality for the conditional entropy 

S(A\B) P = S{p AB ) - S( P B ), 

defined for states p on a bipartite (tensor product) 
Hilbert space A ® B. While a double application of 
Lemma [T] would yield such a bound involving both the 
dimensions of A and B, Alicki and Fannes show that if 
IIP — cr|| i < e < 1, then 

\S(A\B) P - S(A\B) a \ < 4elog|A| + 2 h(e). 


the recent discussion of approximately degradable chan¬ 
nels 0. 

We present a simple proof of the Alicki-Fannes inequal¬ 
ity that yields the stronger form of Lemma [2] One of the 
themes of the present paper, to which we draw attention 
here, is the use of entropy inequalities in the proofs. In 
particular, we make use of the concavity of the condi¬ 
tional entropy (which is equivalent to strong subadditiv¬ 
ity of the von Neumann entropy) [27]. In the following 
proof we will specifically rely on two inequalities express¬ 
ing the concavity of the entropy and the fact that it is 
not “too concave” 0: 

^ViS{Pi) < S y^PiPi j < ^PiS{p % ) + H(p). (1) 

i \ i / i 

By introducing a bipartite state p = J2iPiP A ® |*}(*] 7 > 
this is seen to be equivalent to 

S(A\I) < S(A) < S(AI) = S(A\I) + S(I), 
which consists of two applications of strong subadditivity. 

Lemma 2 For states p and a on a Hilbert space A® B, 
if\\\p ~ a \\i < e < 1, then 

| S(A\B) P - S(A\B) a | < 2e log | A\ + (1 + e) h . 

If B is classical in the sense that both p and a are 
so-called qc-states, i.e. with an orthonormal basis {|:r}}, 

P = ^PxPx ® \x){x\ B , er = ^ q x a A ® \x)(x\ B , 

X X 

and analogously if both are cq-states, then this can be 
tightened to 

\S(A\B) p -S(A\B) a \<elog\A\ + (l+e)h(j^) . 

Proof. The right hand side is monotonic in e, hence we 
may assume i|| p — a\\i = e. Let eA = (p — cr)+ be the 
positive part of p — a. Note that because this difference 
is traceless and its trace norm equals 2e, A is a bona fide 
state. Furthermore, 


In particular, this form is independent of the dimension 
of B, which might even be infinite. Note that for classi¬ 
cal, Shannon, conditional entropy, an inequality like the 
above can be obtained from Lemma [0 by convex combi¬ 
nation, resulting in a bound like that of Lemma [1] (see 
below). 

The Alicki-Fannes inequality has several applications 
in quantum information theory, from the proof of asymp¬ 
totic continuity of entanglement measures — most no¬ 
tably squashed entanglement [7] and conditional entan¬ 
glement of mutual information (CEMI) [54] — , to the 
continuity of quantum channel capacities (26l |, and on to 


P = cr + (p ~ cr) 
< a + e A 


= (1+<> (n r f' T+ rb A ) 

=: (1 + e)w. 


By letting eA' := (1 + e)w — p, we obtain another 
A', such that 


1 

u> = -- a 


e . 1 e . , 

; A = - p+- -A'. 


1+e l+e 1+e 1+e 


state 

( 2 ) 
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This is a slightly optimized version of the trick in the 
proof of Alicki and Fannes [l| ; cf. [28j. 

Now, we use the following well-known variational char¬ 
acterization of the conditional entropy: 

—S(A\B) U = minL)(w AS ||l‘ 4 ® £ s ), 

where D(p\\a) = Trp(logp— logo") is the quantum rela¬ 
tive entropy [30:, 46}. Choosing an optimal state £ for u> 
(which is £ = w B ), we have, from Eq. ([2)1. 

S(A\B) U = -D(u AB \\t A ®t; B ) 

= S (cu) T Tr oj log 

+ -^—Trp log Z n + -^— Tr A' log f B 
1 + e 1 + e 

= h {j^)-^r e D (p\ I 1 ® 0- Y^( A 'I|1 ® 0 

< + j^- € S(A\B) p + y~^S(A\B)a', 

where in the third line we have used the concavity upper 
bound from Eq. ©■ Using the other decomposition in 
Eq. ©, the concavity of the conditional entropy, i.e. the 
lower bound in Eq. ©, gives 

S(A\B) U > -^—S(A\B) a + -J—S(A\B) a . 

1 + e 1 + e 

Putting these two bounds together and multiplying by 
1 + e, we arrive at 

S{A\B) a - S(A\B) P < e(S(A\B) A , - S(A\B) a ) 

+ (l + e) h (^j. 

The proof of the general bound is concluded observing 
that the conditional entropy of any state is bounded be¬ 
tween — log |A| and + log |A|. 

For the case of two qc-states or two cq-states as above, 
note that the states A and A' are of the same, qc-form 
(cq-form, resp.), and so their conditional entropies are 
between 0 and log |A|. □ 

Remark 3 Lemma [2] is almost best possible, as we can 
see by considering the example of a AB = <f>d, the max¬ 
imally entangled state on A = B = C d , and p AB = 

(1 - e)<F d + ^rn(l ~ $d)- Clearly, \\\p- cr||i = e, while 

S(A\B) P -S(A\B)„ 

= (e log(d 2 - 1) + h(e) - log d) - (- log d) 

= 2e log d + h(e) - O . 

This asymptotically matches Lemma [2] for large d and 
small e. 


As an application of Lemma [2) we can obtain tighter 
continuity bounds on various quantum channel capaci¬ 
ties, simply substituting our tighter bound rather than 
the original formulation of Alicki and Fannes in the proofs 
of Leung and Smith [26) . 

As a token, we demonstrate a tight version of the 
asymptotic continuity of the entanglement of forma¬ 
tion [J|, 

E F (p) = mi'y^jj x S{ r l\f 3 p x ) s.t. p = y ^p x p x 

X X 

for a state p AB on the bipartite system A® B, originally 
due to Nielsen [2p| . We then go on to prove asymp¬ 
totic continuity for its regularization, the entanglement 
cost 0, 

E c {p)=Ef{p)= lim -E F (p® n ), 

n—^oo 77 , 

which, albeit following the general “telescoping” strategy 
of [~26|. requires a new idea, and seems not to have been 
known before @]. Note that Eq is different from Ep [l5[ . 

Corollary 4 Let p and a be states on the system A®B, 
denoting the smaller of the two dimensions by d. Then, 
i|| p — a ||i < e implies, with 6 = \Je(2 — e), 

I E F (p) - E F (a )| < <5log d + (l + (S)/i^-^ , 

I E c (p) - E c (a) | < 26 log d + (1 + S) h (j^j • 

Note that these bounds only depend on the smaller of 
the two dimensions, in contrast to [29j |; in particular, 
they apply even in the case that one of the two Hilbert 
spaces is infinite dimensional. 

Proof. We may assume w.l.o.g. that Ep{p) > E F {a) 
and \B\ > |A| = d. Choose a purifying system R ~ AB, 
and pure states <p ABR and ip ABR with ip AB = p and 
tp AB = a = ip R such that 

l(¥#)l = F(p, a) > 1 - e, 

thus — -0Hi < 6 = i/l — (1 — e) 2 . Here, F(p,a) = 
|| t/P\/~(t\\i is the fidelity between two quantum states, and 
we have used that it is related to the trace distance by 
these well-known inequalities 0: 

1 ~F(p,a) < \\\p-o\\i < y/l-F{p,a) 2 . (3) 

By an observation of Schrodinger (which he called 
“steering”) in the context of his investigation of quantum 
entanglement @, cf. [22} , for any convex decomposition 
a = '}2 x PxO'x, there exists a measurement POVM ( M x ) 
on R such that p x a x = Tr/j^T 43 ® M R ). Introduc¬ 
ing the qc-channel A4(£) = J2 X Tr£M c |ir)(a;| from I? to a 
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suitable space X , we then have 

a := (id ab ® M)ip = ^ p x a£ B ®\x){x\ x , (4) 

X 

and S(A\X)„ = ^p x S'(Tr s a x ). 

X 

Let us choose an optimal decomposition for the pur¬ 
pose of entanglement of formation, and the correspond¬ 
ing POVM and quantum channel, i.e. E F (a) = S(A\X)z- 
Applying the same to ip ABR , we obtain 


— ©||i < S, once more invoking Eq. ©. As before 
we choose an optimal decomposition of a AB ® fl A B into 
states on AA! : BB', which we can represent by a POVM 
and associated cptp map M : RR' — > X: 

a := (kUa'bb' ® M){ip ® v) 

= Px a x A BB ' 0 \ X ){ X \ X , 

X 

E F (a <g> fi) = S{AA'\X)s = y^p x 5(Tr BB / a x ). 


p := (id as ®M)ip = ^ ~2q x p x B ® 

X 

with (fc = TTtp R M x . Hence, 

-Uf(p) < p x ) = S(A\X)p. 

X 

Observe that by the contractivity of the trace norm under 
cptp maps, 

S> \\ip-p\\i > \\a-p\h- 
Now we can invoke the classical part of Lemma O 
E F (p) - E F (a) < S(A\X) ? - S(A\X)z 

<S\ogd+(l + 5)h > 

and we are done. 

For the regularization, consider any integer n and 

E F (p® n ) - E F (<j® n ) 

n 

= J2 Ef (p®* 0a0n_t ) - Er 0 °® n ~ t+1 ) 

t =1 
n 

< ^2 I E F (p ® fit) - E F (cr ® f2t)|, 

t= 1 

(5) 


Applying the same map to w ® w, we get 
p := (kUa'bb' ® M )(0 ® v) 

= ^PxP x A ' BB ' ® |z)(z| A , 

X 

E F (a® fl) < = ^p^Ttbs/ p x ), 

X 

where we observe that, crucially, the same p^, appear in 
the expressions for p and a. Using Q R = a T = ip R , we 
even have 

~A b x _ 0 j\4)(a T <g> v) = a A B x . 

Thus with Lemma [21 as desired, 

E F {p® fl) -E F (cr® fl) < S(AA'\X)p — S(AA'\X) S 

= S(A\A'X) ? - S(A\A'X)j 

< 2<5 log d+ (1 + 6) h , 

where in the second line we have used the chain rule 
S(AA'\X) = S , (A , |A)+S , (A|A'A), as well as S{A'\X)p = 
S(A'\X)„. □ 

Proposition 5 (“Quantum coupling”) Given states 
p and a on a Hilbert space A, with L||p — cr||i < e, 
there exist purifications | ip) of p and \ip) of a, and a 
(sub-normalized) vector |$), all three in the tensor square 
Hilbert space A® A =: A 1 A 2 , such that 


with fl t = p m 1 ® a® n *. The proof will be concluded 
by showing that for any B , 

\E F (p® 0) - E f (cj ® 0)| < 25log d + (1 + S) hi ) , 


as this will imply from Eq. © that 


Ep(p® n ) ~ Ep(cr® n ) <26\ogd+(l + S)h 


1 + <5 


To see this, assume again w.l.o.g. that E F {p ® fl) > 
E F (a 0 fl), and choose a purification v of fl on A'B'R', 
with R' ~ A'B'. Besides the purification if ABR of a, 
we now need a state (not generally pure) Q ABR with 
Q ab = p and 0^ = i/j r . Proposition [5] below guarantees 
the existence of such a state with F(ip, 0) > 1 — e, hence 


p T =p A \ p = <p Al >Tr M m, 

a = if M , a T =if A *> Tr Al |i?)(tf|, 

and 

IftWI, |(<p|^)| > 1-e- 

Here, ■ T denotes the transpose of a matrix with respect 
to a chosen basis. 

Consequently, there exists a state with the prop¬ 

erties 0 711 = p and 0 A2 = i/j A 2 = a T , and such that 
F(ip,&), F(tp,Q) > 1 - e. 

This proposition can be viewed as a quantum analogue 
of the coupling of random variables X ~ p and Y ~ q 
such that Pr{A ^ Y} = L||p — g||i, on which the proof 
of Lemma [1] relied. 
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Proof. Fixing an orthonormal basis {|i)} of A, and in¬ 
troducing the unnormalized maximally entangled vector 


l*>=£|i> Al |i> Aa , 


we have the following two “pretty good purifications” [53| 
of p and cr: 

W) : = (Vp® a )l$> = (i ® Vp t ) l $ )> 

|V) := (y/a 0 1)|$) = 0 \/o T ^J |$)i 

the claimed properties of which can be readily checked. 

To obtain |d), we use once more Eq. © from the proof 
of Lemma [5J 


1 


e 1 

-A = -- p ■ 


1 + e 1 + e 1 +e 1 + e 

with states A and A'. Then define 

1 


-A', 


l*> == 


\/T" 


p^co-^a 1 / 2 ® 1 )|$) 


= (A 0 v^ T ) |$) = (A® 1 )| 
= (Vp® a)|$) = (i®y)| 0 , 
using {Z 0 1)|$) = (1 0 Z T )|$), with 
1 


A = 
Y = 


VT+ e 
1 


p 1 ^- 1 ' 2 , 


(a T ) 1 / 2 ( W T )- 1 / 2 . 


vTT"£ 

We claim that ||A||, ||Y|| < 1. Indeed, u > j^p, so 


AA f = 


< 


1 + e 
1 

1+1 


Vpu % Vp 

Vp[V + e)p~ 1 } Vp = a » 


and similarly YY^ < 1. From this it follows that 

A a 2 _ Xr^^A'^A 0 1 )^; < ip A2 = a T , and 

d Al _ XrA 2 (l 0 Y^Y)<p < ip Al = p. 

It remains to bound the inner product |(V|'d)| (the 
other one, |(<^|d)|, is completely analogous): 


\(m\ = 


1 

($1 (y/ 2 u,-i/v/ 2 0 

Tr ,/puj~ l ^ 2 a 

Tr Vpw -1 ^ 2 [(1 + e)w 

(1 + e) Tr VpVu — el 

VI + e 

1 

VI + e 

1 

VI + e 

1 

VI + e 

Tr VpV (1 + e)w — e|Tr AA| 


v^ T ) m 


> Tr VpVp — e||A|| ||A||i > 1 - e, 


where we have first used the definitions of |V), |d) and 
|$), and then the identity between w and er; the fifth line 
is by triangle inequality, in the sixth we used (1 + e)w > p 
once more, the operator monotonicity of the square root, 
and the Holder inequality |TrAA| < ||A||||A||i; in the 
last step we use the fact that both p and A are states 
and ||A|| < 1 . 

Finally, to obtain 0, we write 

P=IWI Al + (l-W))Ar, 
a T = |i?X ^| A2 + (! — (^l^))A 2 ) 

with bona fide states Ai and A 2 . It is straightforward 
to check that the definition 

0 := + (1 - 0 A 2 

satisfies all requirements on 0 . □ 


Remark 6 Although the above proof refers to the un¬ 
normalized vector |$), and thus taken literally only 
makes sense for finite dimensional Hilbert spaces, the 
proposition remains true also in the infinite dimensional 
(separable) case. This can be seen either by finite dimen¬ 
sional approximation, or by considering |$) as a formal 
device to mediate between normalized entangled vectors 
(|<^), | ip), I'd), etc) and Hilbert-Schmidt class operators 
(y/p, Vo, p 1 / 2 w _ 1 / 2 CT 1/2 , etc). 

III. RELATIVE ENTROPY DISTANCES 

The same method employed in Lemma[2]can be used to 
derive asymptotic continuity bounds for the relative en¬ 
tropy distance with respect to any closed convex set C of 
states, or more generally positive semidefinite operators, 
on a Hilbert space A , cf. [45j] ), 

D c(p) = mmD(p\\'y). ( 6 ) 

7 GC 

Unlike [HI], C has to contain only at least one full-rank 
state, so that Dc is guaranteed to be finite; in addition, 
C should be bounded, so that Dc is bounded from be¬ 
low. We recover the conditional entropy S(A\B) p for a 
bipartite state p on A 0 B, as Dc(p) with 

C = {lb 4 0 a B : a a state on B }. 

Lemma 7 For a closed, convex and bounded set C of 
positive semidefinite operators, containing at least one of 
full rank, let 

k ■■= sup D c (t ) - D c (r') 

t,t' 

be the largest variation of Dc- Then, for any two states 
p and a with i ||p — cr||i < e, 

\D c (p) - D c {a)\ < eK+(l+e)h(-V_)j . ( 7 ) 
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Proof. The only modification with respect to the proof 
of Lemma[2]is that we replace the invocation of concavity 
of the conditional entropy with the joint convexity of the 
relative entropy, which makes Dc a convex functional. 
Namely, with u> as in Eq. ©, we have on the one hand, 

< —— D c (cr) + ——£>c(A). 

1 + e 1 + e 

On the other hand, with an optimal 7 £ C, 


For the regularization, consider any integer n and 

E R (p 9n ) - E a (<r 9n ) 

n 

= J2 E r(p^ ® ^ n ~ t ) - Mp**' 1 ® a 0n - t+1 ) 

t—1 

n 

< ^2 1 E R (p®si t ) - e r (c r ® fi t )|, 

t =1 


Dc(u) = D(w H 7 ) 

= —S{w) — Trwlogy 
e 


> -h 


= -h 


1 + eJ 1 + e 
1 


1 S(p)--^-S{ A') 


1 + e 


Tr p log 7 - 


1 + e 
e 


1 


1 + e J 1 + e 


D {p\\l) 


1 + e 
e 


1 + e 


Tr A' log 7 

^(A'117) 


2 -KtT 7) + TT^> + TT+W 4 '>- 


with fi t = p® t_1 ^a® n ~ t . Now for each t, Lemma[7]gives 

I E R {p ® fi t ) - E R (a ® fi t )| < ere* + (1 + e) h 1 

with K t = sup tt ,(E r (t ® — E r (t' ® lit))- To see 

this, we have to look into the proof of the lemma, and 
observe that for states and < 7 ®fit, also the auxiliary 

operators A and A' are of the form r ® fit and t' ® fi t . 
However, by LOCC monotonicity, 


E r (t' ® fi t ) > E fl (fit), 


Putting these two inequalities together yields the claim 
of the lemma. □ 

In particular, in the case that 
C = SEP(H : B ) 

:= cornea ;" 4 ® /3 s : a, /? states on +, B, resp.} 


and similarly 

E r (t ® fit) < Eit(d> d ®fi t ) < log d, + E/{(fit), 

so that K t < logd. Although we do not need it, the right 
hand inequality is in fact an equality, E R ($>d ® fi t ) = 
logd + Efl(fif) (HI. Thus, we obtain for all n, 


is the set of separable states, we obtain the relative en¬ 
tropy of entanglement of a state p _on bipartite system 
A®B , E R (p) = T>sep(A:B)(p) E3- Furthermore, we 
consider its regularization 

E%(p) = lim -E R (p® n ), 

n—>00 ft 

which is known to be different from E R (p) in general [48j . 


~E R {p® n ) 
n v 7 


-E R (cj® n ) 

n 


< elogd+ (1 + e) h 



and taking the limit n —> 00 concludes the proof. □ 

Again, in Lemma[7]and Corollary[ 8 j the constant in the 
linear term (proportional to e) is essentially best possible, 
as we see by taking two states maximizing the difference 
Dc{p) — Dc{cr) 1 i.e. attaining re, since \\\p— oj|i < 1 =: e. 


Corollary 8 (Cf. Donald/Horodecki )lll| &; Chris- 
tandl @]) For any two states p and a on the composite 
system A ® B, denoting the smaller of the dimensions 
\A\, \B\ by d, i|| p — a\\i < e implies 

\E R (p) — E R (a)\ < elogd+(1+ e)/i^Y^—^ , 

I E%(p) - E r (<j)\ < elogd+ (1 + e) /i( T ^) . 

Note that this bound only depends on the smaller of the 
two dimensions, in contrast to 0; in particular, it ap¬ 
plies even in the case that one of the two Hilbert spaces 
is infinite dimensional. 

Proof. The first bound, on the single-letter E R is a 
direct application of Lemma [7] to the case where C is the 
set of all separable states on A ® B. 


Remark 9 Lemma[7]improves upon similar-looking gen¬ 
eral bounds by Synak-Radtke and Horodecki j45j, which 
were subsequently optimized by Mosonyi and Hiai [28], 
Prop. VI. 1]. The latter paper also explains lucidly (in 
Sec. VI) that the coefficient j-A in the convex decompo¬ 
sition of w in two ways, into p and A' and into a and A, is 
optimal, and gives a nice geometric interpretation of u> as 
a max-relative entropy center of p and a (cf. 0 ). Thus, 
at least following the same strategy one cannot improve 
the bound any more. 

That the regularized relative entropy measure E is 
asymptotically continuous followed previously from its 
non-lockability [2l| . which it inherits from E R . This has 
been worked out in [§, Prop. 13], following [1, Prop. 3.23], 
with a different linear term. 

Remark 10 It would be interesting to lift the restric¬ 
tion that C has to be a convex set: Natural examples are 
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the case that C is the set of all product states in a bipar¬ 
tite (multipartite) system, in which case Dq becomes the 
quantum mutual information (multi-information); or the 
case that C is the closure of the set of all Gibbs states 
for a suitable Hamiltonian operator H, 

c = {r^H e ~ 0H :/3>0 }' 

Both examples have in common that C is an exponen¬ 
tial family (or the closure of one); it is known that at least 
in some cases Dq is continuous, but counterexamples of 
discontinuous behaviour are known [5lj. 

IV. BOUNDED ENERGY 


The entropy in this case is given by 

S( 7 (E)) = log Z + /3(E)(loge)E. 

This implies that the spectrum is unbounded above, and 
that the energy levels cannot become “too dense” with 
growing energy value. 

Let us immediately draw some conclusions from these 
assumptions; the following is a simply consequence of Shi¬ 
rokov’s I.TT . Prop. 1], for which we present an elementary 
proof. 

Proposition 11 For a Hamiltonian H satisfying the 
Gibbs Hypothesis, 5( 7 (E)) is a strictly increasing, 
strictly concave function of the energy E. 


If the Hilbert space in the Fannes inequality (Lemma[TJ 
has infinite dimension, or likewise A in the Alicki-Fannes 
inequality (Lemma [2j), then the bound becomes triv¬ 
ial: the right hand side is infinite. This is completely 
natural, since the entropy is not even continuous, and 
these Fannes-type bounds imply a sort of uniform conti¬ 
nuity. Continuity is restored, however, when restricting 
to states of finite energy, for instance of a quantum har¬ 
monic oscillator [5d|, see also [l2| and [H] for more recent 
results and excellent surveys on the status of continuity 
of the entropy. Shirokov [39i ] has developed an approach 
do prove (local) continuity of entropic quantities, based 
on certain finite entropy assumptions, in which he uses 
Alicki-Fannes inequalities on finite approximations. 

Uniform bounds are still out of the question, but what 
we shall show here is that the Fannes and Alicki-Fannes 
inequalities discussed above have satisfying analogues, 
with a dependence on the energy of the states rather 
than the Hilbert space dimension. 

Abstractly, our setting is this: Consider a Hamilto¬ 
nian H on a infinite dimensional separable Hilbert space 
A. If there is another system B and we consider bi¬ 
partite states and conditional entropy, we implicitly as¬ 
sume trivial Hamiltonian on B, i.e. global Hamiltonian 
H = H a ® 1 b . We shall need a number of assumptions 
on H, to start with that it has discrete spectrum and 
that it is bounded from below; for normalization pur¬ 
poses we fix the ground state energy of H to be 0. The 
mathematically precise assumption is the following. 

Gibbs Hypothesis. For every /3 > 0, let the parti¬ 
tion function Z(/3) := Tre~^ H < oo be finite, so that 
_i_ e -P H j s a } 3 0na fide state with finite entropy. In 
this case, for every energy E in the spectrum of H, 
the (unique) maximizer of the entropy S(p) subject to 
Tr pH < E is of the Gibbs form: 


7 {E) 


1 r -/3(E)H 

z(m ) 


where /? = (3(E) is decreasing with E and is the solution 
to the equation 

Tr e~P H (H — E) = 0. 


Proof. It is clear from the maximum entropy charac¬ 
terization of 7 (E) that the entropy as a function of E 
must be non-decreasing; it is unbounded by looking at 
the formula for the entropy in terms of log Z. 

Furthermore, for energies E\ and E 2 , and 0 < p < 1, 

Tr(p 7 (Ai) + (1 — p) r y(E 2 ))H < pEi + (1 - p)E 2 =: E, 

and so concavity follows: 

S(l( E )) > S(pj(E 1 ) + (1 -ph{E 2 )) 

>pS(^E 1 ))+(l-p)S( 1 (E 2 )). 

From this it follows that S(jy(E)^ is strictly increas¬ 
ing, because otherwise 5( 7 (^ 1 )) = S{j(E 2 )) for some 
Ei < E 2 , but then S(j(E 2 )^j < S["/(E 3 )^ for some 
E 2 <E 3 , since the entropy grows to infinity as E —» 00, 
contradicting concavity. 

But this means that for E\ ^ E 2 , necessarily 7 (FA) ^ 
7 (^ 2 )) and so by the strict concavity of the von Neumann 
entropy, we have strict inequality in the second line of 
Eq. ©) for 0 < p < 1. □ 

Corollary 12 If H satisfies the Gibbs Hypothesis, then 
for any S > 0, 

sup XS('y(E/X))=SS('y{E/S)). 

0<\<6 

Proof. The right hand side is clearly attained by letting 
A = S. To prove “<” for any admissible A, observe that 
by concavity (Proposition II II) . 

S(j(tF)) > i S( 7 (F)) + (1 -t) 5(7(0)) > t S(j(F)). 

Letting t = j < 1 and F = concludes the proof. □ 


Remark 13 Another useful fact proved by Shirokov [37j . 
Prop. 1 (ii)], which we shall invoke later, is that under 
our assumptions, S(y(E)) = o(E), which can be recast 
as saying that S S( r y(E/S)) —> 0 for every finite E and 
<5 —>- 0. □ 





We start with an easy-to-prove continuity bound for 
the entropy, inspired by the proof of Lemma [T] though 
for the conditional entropy we shall have to resort to a 
different argument. It uses a quantum coupling as in 
Proposition [5] (which implies a weaker bound in the fol¬ 
lowing, with the square of the expression on the right 
hand side). 

Proposition 14 Let p and a be states on the same 
Hilbert space A, and consider the tensor square A(g>A=: 
A 1 A 2 of the quantum system. Then, there exists a state 
u> with ur Al = p, lo A2 = a and such that 

IM|oo>l~||p —a||i. 

Proof. Choose spectral decompositions 
P=J2’r l \e i ){e i \, 

i 

i 

of the two states, with 77 > r 2 > ... and Si > s% > . 
then, the ^-distance between the probability vectors ( 77 ) 
and (sj) is not larger than the trace distance between p 
and er: 

||p-*||i>||(r 0 -(*)lli= : 2 e. 

(This is known as Mirksy’s inequality [2(1 Cor. 7.4.9.3].) 
Defining a vector 

|</>) := V / min{r i ,s i }|e i ) Al |/ i ) j42 

i 

in AiA 2 , we clearly have Tr \(f)((j)\ = 1 — e, and (f Al < p , 
^t > A2 < a, thus we can write 

P = \4>}(<l>\ Al + eA i> ° = \<t>){(t>\ M +eA 2 , 

with bona fide states Ai and A 2 . 

It is straightforward to check that the definition ui := 
\(f>){4>\ + eAi (g) A 2 satisfies all requirements on w. □ 

Lemma 15 Let the Hamiltonian H on A satisfying the 
Gibbs Hypothesis. Then for any two states p and cr on A 
with Tr pH, TraH < E and \\\p — cr\\i < e < 1, 

l'S'(p) - ‘S'(cr) | < 2e5(7(S/e)) +h(e). 

Proof. Pick a state ui on A 1 A 2 , according to Proposi¬ 
tion [llj oj Ai = p, uj A2 = cr, and with largest eigenvalue 
> 1 — e, meaning that we can write 

w = (l - e)\ip){ip\ + eu', 

with a pure state \if) (the normalized vector \<f>) from the 
proof of Proposition [bill and some other state oj' . Hence, 

| S(p) - 5(a) | = |5(w Al ) - 5 (w A2 )| 

< S(u AlA2 ) 

< eS(u>') + h(e) 
<2eS( 1 (E/e))+h(e). 


Here, we have first used the marginals of to, then in the 
second line the Araki-Lieb “triangle” inequality in 
the third line strong subadditivity, and in the last step 
the maximum entropy principle, noting that with respect 
to the Hamiltonian H Al ® + t Al ® H A2 , ui has energy 
< 2 E, and so the energy of u/ is bounded by 2E/e. For 
the last line, observe that the Gibbs state at energy 2 E/e 
of the composite system is 7 (E/e)® 2 . □ 

The following two general bounds lack perhaps the sim¬ 
ple elegance of Lemma |T5l but they turn out to be more 
flexible, and stronger in certain regimes. 

Meta-Lemma 16 (Entropy) For a Hamiltonian H on 
A satisfying the Gibbs Hypothesis and any two states p 
and a with Tr pH, Tr aH < E, ^\\p — a||i < e < e' < 1, 
and S = 

|5(p)-5(a)| < (e 7 + 2S)S(j(E/S)) + h(e') + h(S). 

Meta-Lemma 17 (Conditional entropy) For states 
p and a on the bipartite system A ® B and otherwise 
the same assumption as before, 

\S(A\B) P — S(A\B) a \ < (2e , + 46)S(' 1 (E/6)) 

+ (1 + e ')h(j + + 2h{8). 

To interpret these bounds, we remark that in a certain 
sense they show that the Gibbs entropy at the cutoff 
energy E/e {E/5) takes on the role of the logarithm of 
the dimension in the finite dimensional case. Before we 
launch into their proof, let us introduce some notation: 
Define the energy cutoff projectors 

P<~ Y l n X n l> P>:=l-P<, 

0 <E n <E/8 

where |n) is the eigenvector of eigenvalue E n of the 
Hamiltonian H. We shall also consider the pinching map 

T(f) = P<fP< + P>fP>, 

which is a unital channel, as well as its action on the 
original p and a: 

P{p) = : (! - A)p< + A p>, 

T(a) =: (1 - p)a< + pa > . 

Note that because H commutes with the action of T, 
we have Tr£P = Tr T{£/)H, and so the energy bound E 
applies also to T{p) and T(a). Hence, 

A <5, ATr pyH < E, p<5, pTrcr > H < E. (9) 

Our strategy will be to relate S(p) to S{p< ) (and the 
same for a and a<) via entropy inequalities, including 
concavity, similar to the first part of the paper, and then 
apply the usual Fannes (Alicki-Fannes) inequalities to p< 
and a<. 




9 


Proof of Lemma 1161 First of all, by concavity of the 
entropy (monotonicity under unital cptp maps), 

S(p)<S(T(p)) 

= h(X) + (l-X)S(p<) + XS( P> ). 1 J 

Now, by Eq. <[9|), the maximum entropy principle and 
Corollary fl2l 

AS( P> ) < XS( 7 (E/X)) < 5S( 7 (E/5)). 

Thus, from Eq. m, observing 5 < 5 , we get 

S(p)<S(p<) + h(5)+8S( 7 (E/5)), (11) 

and likewise for a. 

Second, we have 

S(a) > (1 — p)S(cr<) + pS(a > ). (12) 

To see this, we think of the action of T as a binary mea¬ 
surement on the system A , which we can implement co¬ 
herently with two ancilla qubits X and X', 

i <p) —► (p<m A m xx ' + (p> w)) A \n) xx '. 

Applying this to a, we have by unitary invariance and 
the Araki-Lieb “triangle” inequality, 

S(a) = S(AXX') > S(AX) - S(X') 

= S(AX) - S(X) 

= S(A\X) = (1 - p)S(a<) + /i5(tr>). 

Thus, using that the energy of a< is at most E/S by 
construction, and so S(a<) < S( 7 (E/5)), 

S(a) > (1 - p)S(a<) > S(a<) - 6S( 7 (E/6)). (13) 

Third, by definitions, contractivity of the trace norm 
and triangle inequality, 

2e > \\p- er||i 

> ||P<pP< - -P<o'P <|| 1 

= IK 1 - A )p< - (1 - m)ct< || x 

= || (1 - S)(p< - cr<) + (5 - A )p< + {p — <5)cr<|| 1 

> (! - <5)||p< - cr<||i - 25, 
and so 

\\\p<-cr<\\i< e ^- 6 =e'. (14) 

Hence by the Fannes inequality in the form of Lemma [T| 

\S{p<) ~ S{cr<)\ < e' \ogTr P< + h(e') 

< e'S( 7 (E/S)) + h(e'). 

The latter inequality holds because the state Tr p / < 
clearly has energy bounded by E/5, and so cannot have 
entropy larger than the Gibbs state. 


With these three elements we can conclude the proof: 
W.l.o.g. S(p) > S(a), and so from Eqs. (fill) . (fl3l) and 

Ci, 

S(p) - S(a) < S(p<) - S(a<) + h(5) + 25S( 7 (E/5)) 

< (e' + 25)S( 7 (E/5)) + h{e') + h(5), 

as advertised. □ 

Proof of Lemma ll71 It is very similar to the previous 
one, only that we have to be a bit more careful in some 
details, as the conditional entropy can be negative. 

The first step goes through almost unchanged, with the 
map T ® ids, since the conditional entropy is concave as 
well (equivalent to strong subadditivity) |27| : 

S(A\B) P < S(A\B) T (p) 

= h( A) + (1 - X)S(A\B) Pi + XS(A\B ) P> . 

The remainder term XS(A\B) p> is upper bounded by 
XS(p>) (again by strong subadditivity), hence the upper 
bound XS( 7 (E/X)) still applies. The only change is due 
to the fact that the conditional entropy can be negative. 
However, for any bipartite state £ AB , 

-S(d A ) < S(A\B)t < S(£ A ). 

Here, the right hand inequality is strong subadditiv¬ 
ity that we have used before; introducing a purification 
| ip) ABC of the state, we have — S(A\B) V = S(A\C) V < 
S(£ a ), which is the left hand inequality. Thus, 

(1 - A)S(A|H) P < < S{A\B) Pi + 5S( 7 (E/5)). 

Altogether, 

S(A\B) P < S(A\B) P < + 2 5S( 7 (E/5)) + h{5). (16) 

Also the second step requires only minor modifications: 
With the notation of the previous proof, and using the 
Araki-Lieb “triangle” inequality once again, 

S{AXX'\B) = S(AXX'B) - S{B) 

> S(ABX) - S(X') - S{B) 

= S(ABX) - S(BX) - S{X) - S(B) + S(XB) 
= S(A\BX) — I(X : B) 

> S(A\BX) - h{5). 

Again, since conditional entropies can be negative, we 
have to be more careful with remainder terms and get 

S(A\B) a > S{A\B) as - 25S( 7 {E/5)) - h(S). (17) 

In the third step, the trace norm estimate (flil) goes 
through unchanged, and then we apply the Alicki-Fannes 
inequality in the form of Lemma [2] 

|S(A|H) P . -S(A|H) cts | 

< 2e' log Tr P< + (1 + e') h 

<2 e '5( 7 (^))+(l + e / )/ l ( T ^ 7 ) • 
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Putting this together with Eqs. © and © , assum¬ 
ing w.l.o.g. that S(A\B) p > S{A\B) a , we obtain 

S(A\B) P - S(A\B) a < S(A\B) P < - S{A\B) a < 

+ 2h(6) + 46S( , y(E/5)) 

< (2e' + 46)S(j{E/6)) 

+ (1 + e') h ^ ^ + 2 h(S), 

and we are done. □ 

The bounds of Lemmas fl5l [TO and [T71 are very general, 
and it may not be immediately apparent how useful they 
are. However, thanks to [13, Prop. 1(h)], restated in 
Remark [T3l SS{y(E/S)) —> 0 for every finite E, as <5 —> 0 
(cf. [13, Cor. 4]). Thus, choosing e' = y/e, the lemmas do 
prove continuity of the entropy and conditional entropy 
in general, and uniformly for each fixed energy. 

We now specialize our bounds to the important case of 
a collection of £ quantum harmonic oscillators, where we 
shall see that the bounds are asymptotically tight. The 
Hamiltonian is 

l 

H = ajctj, (18) 

i= 1 


with E =: £E. 

By using this upper bound in Lemmas [16] and HU for 
S = ae(l — e), with a parameter a between 0 and 1 , and 
introducing 


h(x) 


h(x) for x < i, 
1 for x > i, 


we obtain directly the following: 


Lemma 18 Consider two states p and a of the l- 
oscillator system whose energies are bounded 

Tr pH, TrcriL < E = £E. Then, ^||/ 9 — a||i < e < 1 
implies 


|5(p) - 5(a)| 
A T g 


< e 


1 — a 


+ 2a 


E 
hco. 

1 + a 
1 — a 


Y l lo &{j^ + 1 ) +ilo s 

. 2=1 N 

P {£ + 2) 


+ 2 a h 


a(l-e) 

1 + a 

l-a 


If the states live on a system composed of the £ oscillators 
(A) and another system B, then 


where u>i is the native frequency of the i- th oscillator 
and ai is its annihilation (aka lowering) operator (see 
e.g. [25| or El)- Note that we chose the slightly unusual 
energy convention such that the ground state has energy 
0 , rather than ifaui, to be able to apply directly our 
above results. In the case of a single mode, and choosing 
units such that Huj 1 = 1, the Hamiltonian simply becomes 
the number operator N. In that case, it is well-known 
that 

S(j(N)) = g(N) := (N + 1) log(iV + 1) - N log N 
< log(IV + 1) + loge. 

Crucially, and in accordance with Proposition 1111 g is a 
concave, monotone increasing function of N. 

In the general case of Eq. m, 7 (E) = 0tr7 i(Ei), 
with E = Ei an d where 7 i(Ei) is the Gibbs state of 
the i-th mode with energy Ej. Maximizing the entropy, 

over all allocations of the total energy over the £ modes 
leads to a transcendental equation, but we do not need 
to solve it as we only want an upper bound, via g(N) < 
\og(N + 1) + loge. By a straightforward Lagrange mul¬ 
tiplier calculation we see that the optimum is to divide 
the energy equally among the modes: 


S(t{E)) < max^ 


i— 1 L 


log [ ^- + 1 ) + log e 


= (log e)£ + log ( + 1 


(19) 


hioi 


\S(A\B) p -S(A\B) a 

< 2e fl±^ + 2 a 

1 — a 




(2£ + 4) 


E 

hjJi 

1 + a 

1 — a 


+ 1 + £ log - 


,(l-e) 


+ 2 a) h ( *' D 


Remark 19 For each fixed e < 1, we can make a ar¬ 
bitrarily small, and then for large energy E 'f2 i huii, 
the bounds of Lemma[lH]are asymptotically tight, in the 
sense that apart from the additive offset terms, the factor 
multiplying e ( 2 e, resp.) cannot be smaller than 

i =1 x / 

This can be seen in the entropy case by comparing the 
vacuum state p = | 0 )( 0 |®^ of all £ modes with the state 
a = (1 — e)|0)(0|®^ + eq (E); in the conditional entropy 
case, take p to be a purification of the Gibbs state 7 (E) 
on A® B, and a = (1 — e)p + eq (E) A ® t b with an 
arbitrary state r on B. 


V. CONCLUSIONS 

Using entropy inequalities, specifically concavity, we 
improved the appearance of the Alicki-Fannes inequality 
for the conditional von Neumann entropy to an almost 
tight form. It would be nice to know the ultimately best 
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form among all formulas that depend only on the dimen¬ 
sion of the Hilbert space and the trace distance, but we 
have to leave this as an open problem. 

In particular, it would be curious to find the optimal 
form of the fidelity in Proposition [Sj 

F := maxF(^, 0) s.t. 0 Al = p, 

> 1 - ^IIP — cr||l: 

with a fixed purification if of er, and of Proposition lldl 

1 ^ \\\P~v\\i < max Halloo s.t. uj Ai = p, = a, 

which may be regarded as quantum state analogues of 
the coupling random variables, 

\\\P~ 9||i = minPr{X ^ Y} s.t. X ~ p, Y ~ q. 

Furthermore, are there versions of these statements that 
would allow for alternative proofs or tighter versions of 
Lemmas [2] and [T7] for the conditional entropy? 

The same principle lead to the apparently first uniform 
continuity bounds of the entropy and conditional on in¬ 
finite dimensional Hilbert spaces under a bound on the 
expected energy (or, for that matter, bounded expecta¬ 
tion of any sufficiently well-behaved Hermitian operator). 
In the case of a system of harmonic oscillators, we have 
seen that the bound is, in a certain sense, asymptotically 
tight, even though here we are much farther away from 
a universally optimal form. 

The Fannes and Alicki-Fannes inequalities already are 
known to have many applications in quantum informa¬ 
tion theory. These include the continuity of certain 
entanglement measures such as entanglement of forma¬ 
tion [23], relative entropy of entanglement [HJ , squashed 
entanglement [7j and conditional entanglement of mutual 
information [541 ] , and of various quantum channel capaci¬ 
ties [2(|. In fact, we always get explicit continuity bounds 


in terms of the trace distance of the states or diamond 
norm distance of the channels, respectively. While in 
many applications it is of minor interest to have the op¬ 
timal form of the bound (for example when e goes to 0 ), 
it pays off to have a tighter bound thanll] in the setting 
of approximately degradable channels f43j]. Indeed, this 
results even in new, tighter upper bounds on the quan¬ 
tum capacity of very quiet depolarizing channels |43( , by 
way of an extension of the methodology of [ilj] . 

The infinite dimensional versions of these entropy 
bounds under an energy constraint are awaiting appli¬ 
cations, though it seems clear that explicit bounds on 
the contin uity and asymptotic continuity of entanglement 
measures [l 2 [ , (e.g. for squashed entanglement since the 
first posting of t he p resent manuscript 0 ) and chan¬ 
nel capacities [1711 1 ?l . 1361 ] in infinite dimension should be 
among the first, as well as the extension of approximate 
degradability 0] to Bosonic channels 0], 
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